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(54) EDITING DEVICE AND EDITING METHOD 
(57)Abstract: 

PROBLEM TO BE SOLVED: To simplify editing processing about the audio data of a 
variable length compression system such as an MP3. 

SOLUTION: This device converts the audio data MP3 of a 1st data format by the 
variable length compression system into audio data ATRAC3 of a 2nd data format by a 
fixed length compression system, performs editing processing about he audio data of 
the 2nd data format and subsequently returns the audio data of the 2nd data format to 
the audio data of the 1 st data format. 



* NOTICES * 

JPO and INPIT are not responsible for any 
damages caused by the use of this translation. 

1 This document has been translated by computer. So the translation may not reflect 
the original precisely. 

2.**** shows the word which can not be translated. 
3.1n the drawings, any words are not translated. 



CLAIMS 



[Claim(s)] 

[Claim 1]An editing device comprising: 

A conversion method which changes audio information of the 1st data form by 
variable-length compression technology into audio information of the 2nd data form 
by fixed-length compression technology. 

An editing means which can perform editing processing about audio information of said 
2nd data form, and a conversion method which changes into audio information of said 
1st data form audio information of said 2nd data form edited by said editing means. 

[Claim 2]The editing device according to claim 1, wherein said 1st data form is the 
MPEG audio layer 3. 

[Claim 3]The editing device according to claim 1, wherein said 2nd data form is 
ATRAC3. 

[Claim 4]Audio information of the 1st data form by variable-length compression 
technology is changed into audio information of the 2nd data form by fixed-length 
compression technology, An editing method performing editing processing about audio 
information of said 2nd changed data form, and changing audio information of said 2nd 
edited data form into audio information of said 1 st data form. 

[Claim 5]The editing method according to claim 4, wherein said 1 st data form is the 
MPEG audio layer 3. 

[Claim 6]The editing method according to claim 4, wherein said 2nd data form is 
ATRAC3. 



DETAILED DESCRIPTION 



[Detailed Description of the Invention] 
[0001] 

[Field of the Invention]This invention relates, for example to an editing device suitable 
about the audio information of the data form by variable-length compression 
technology like the MPEG audio layer 3, and an editing method. 
[0002] 

[Description of the Prior Art]The art which carries out record reproduction of the 
musical file according to the data format called the MPEG audio layer 3 (henceforth 
MP3) is known. This MP3 is in the state which tone quality degradation does not have 
so much compared with the PCM linear audio information (henceforth WAVE data) of 
44.1 kHz (or 48 kHz etc.) of sampling frequencies, for example in 1 6-bit quantization, It 
is a data format which can compress data volume into the abbreviation 1/11. ATRAC3 
method which developed as a compressed data gestalt about audio information from 
the ATRAC method (Adaptive Transform AcousticCoding) used by the mini disc (MD) 
system is also known. 
[0003] 

[Problem(s) to be Solved by the Invention]By the way, MP3 is the data format mainly 
developed for the purpose of the music distribution, and in particular edit of data, the 
write-in unit to a recording medium, etc. are not taken so much into consideration. 
About the audio unit used as one unit of audio information, the data of 1 1 52 samples is 
not necessarily compressed into fixed length by the mode. That is, variable-length 
data is permitted. 

[0004] Drawing 9 shows an example of the data structure of MP3. In MP3, the 
synchronized signal sy and the additional information add are allotted to the head of 
each frame (#1, #2 ...) so that it may illustrate, and main information (DT1, DT2 ...), i.e., 
the information as audio information, is allotted following it. 9-bit main information 
starting position information is allotted to the head of each main information, and the 
starting position of main information is described. Decoding of main information is 
performed with reference to the main information starting position information. 
[0005]In the example of drawing 9 (a), main information DT1 is made into 
comparatively small data volume, and, for this reason, it is completed within frame #1. 
Main information DT2 is made into small data volume, and this is also completed within 
frame #1. And when main information DT3 is large scale, main information DT3 is in 
the state where it is allotted over frame #2 and #3 from the back of main information 
DT2 in frame #1. Main information DT4 is allotted to frame #4. When it is in such a 
data structure, supposing it deletes main information DT2, for example, all the main 
information (DT3, DT4) which follows will be rewritten by eye front **. That is, 
rewriting of data is performed like drawing 9 (b). Since rewriting covering the worst 
several 10 MByte(s) is needed in the case of audio information with comparatively 
much amount of information, such processing is only deleting one main information, 



and has become what has a very big burden of the software which performs the 
processing, or hardware. 

[0006]Since edit of dividing one main information (one music) on a certain point, and 
dividing it into two music, for example cannot take correlation with the break of a 
frame, and the edit unit of a sound, either, it will change the point of edit each time, 
and editing processing turns into quite troublesome processing. That is, in the 
44.1 -kHz mode of MP3, although 1 152 / 44100 seconds are a unit of the frame, an edit 
point must be discovered from this inside. It is also same to, look for the edit point as 
a break of the music in above-mentioned deletion, of course. Also in edit of 
connecting two more music with one music, it is the same in order to have to discover 
the break of music. If the dummy data related to multiples of 8, such as DES, is added 
and considered, much more troublesome processing will be required of the editing 
processing of MP3. 

[0007]Thus, the audio information of MP3 form cannot say it as a thing suitable for 
editing processings, such as deletion of main information (music), and division. MP3 
which maintains high-quality sound, is fully compressed and is made into data size on 
the other hand is considered as the very useful data format, and since it is generally 
used widely, there is a request of liking to edit more easily the audio information 
recorded by MP3 for a user. 
[0008] 

[Means for Solving the Problem]An object of this invention is to provide an editing 
device and an editing method which enabled it to edit audio information of 
variable-length compression technology easily like MP3 according to such a situation. 
[0009]For this reason, a conversion method which changes audio information of the 
1st data form by variable-length compression technology into audio information of the 
2nd data form by fixed-length compression technology in this invention, It has an 
editing means which can perform editing processing about audio information of the 
2nd data form, and a conversion method which changes into audio information of the 
1st data form audio information of the 2nd data form edited by an editing means. For 
example, suppose that the 1 st data form is the MPEG audio layer 3 (MP3), and the 2nd 
data form is ATRAC3. 

[0010]Editing processing is easy for the 2nd data form by fixed-length compression 
technology, for example, audio information of ATRAC3. Then, if it edits by changing 
data of the 1 st data form into the 2nd data form and is made to return to the 1 st data 
form after that, edit of audio information of the 1st data form can perform easily as a 
result. 
[0011] 

[Embodiment of the Invention] Hereafter, the editing device of an embodiment of the 
invention and an editing method are described using drawing 1 - drawing 8 . According 
to this embodiment, the example which used the personal computer as an editing 



device is given. Suppose that the 1st data form is MP3 and the 2nd data form is 
ATRAC3. Explanation is given in the following order. 

1. Edit of composition 3. editing processing 4.ATRAC3 of personal computer used as 
data structure 2. editing device of ATRAC3 [001 2]1. The data structure of ATRAC3 is 
shown in the data structure diagram 1 of ATRAC3. In ATRAC 3 F header HD and the 
additional information add are allotted to the head of each frame (#1 , #2 ...) so that it 
may illustrate, and main information (DT1, DT2 ...), i.e., the information as audio 
information, is allotted following it. 

[0013]Each main information is made into fixed length. Frame units are 1024 / 44100 
seconds, drawing 1 — as the example on explanation — each main information DT1 
and DT2 — although ... is described as capacity which is equivalent to one frame, 
respectively, main information, i.e., the data volume of one music, serves as an integral 
multiple of a frame unit. And since the break of a frame turns into a break also about 
the information on a sound, the editing processing of it becomes possible per frame. 
That is, it becomes unnecessary to discover the edit point for deletion of music, 
division, or combination within a frame like [ in the case of MP3 explained by drawing 

[001 4]2. Composition of personal computer used as editing device [0015]The 
composition of the personal computer 1 of this example used as an editing device is 
shown in drawing 2 . As for the personal computer 1, CPU11, RAM 12, ROM 13, and hard 
disk drive (HDD) 14 grade are provided, and the whole motion control is performed by 
CPU11. RAMI 2 is a part where the application program etc. which a data area when 
CPU1 1 executes a program, and a task field are secured, or are stored in the HDD14 
grade, for example are developed. The operation program and the information required 
in addition to this as a personal computer are memorized ROM13. The program of the 
various kinds [ 14 / HDD] as a mass storage and a data file are stored. In this example, 
the audio file of the MP3 form incorporated from the exterior is also stored in HDD14. 
Transfer of the file data during each block or control data is performed via the bus B. 
[0016]The input device 2 shows various equipment, for example, the keyboard, a 
mouse, etc. as a user's manual operating device, and the operational input information 
from the input device 2 is incorporated into CPU11 via the input interface 15. The 
display 3 is a monitor display device by CRT or a liquid crystal display, it is that the 
display driving part 17 supplies a video signal based on directions of CPU11, and a 
monitor display is performed. 

[0017]in this example, the reproducing output of the data file of the MP3 form stored 
in HDD 14, for example is made possible — thing assumption is carried out, for this 
reason, the audio decode part 18 and D/A converter 19 are formed, and it enables it 
to supply an audio signal to the loudspeaker 4 The audio decode part 18 performs 
decoding about the MP3 audio file read from HDD14 by control of CPU11, and 
changes it into the linear audio information of WAVE data, i.e., for example, 16-bit 



quantization, and a 44.1 -kHz sampling. With D/A converter 19, the linear audio 
information is changed into an analog signal, is amplified, is supplied to the loudspeaker 
4, and is outputted to a user. 

[0018]The communication interface 16 is a part which connects with ordinary public 
circuits, such as the transmission line 7, for example, an ISDN circuit etc., or a certain 
dedicated line, and performs data communications. For example, download of the MP3 
audio file through the Internet etc., etc. are performed. The transmission line 7 may 
assume wireless communications lines, such as a satellite communication line using 
the parabolic antenna 5 installed in the communications satellite 4 or each home 2, 
etc. 

[0019]The external device interface 20 is a part which performs data transmission and 
reception among various kinds of external devices connected with the personal 
computer 1 via the connector 21 . On a drawing, although the connector 21 shows only 
one, they may be provided. [ two or more ] A transmission system is considered in 
some numbers IEEE1394, USB, IEC958, a serial port, a parallel port, SCSI, etc. The 
MP3 recording and reproducing device 5 which can perform record reproduction about 
an MP3 audio file, for example if it says about the operation of this example mentioned 
later, ATRAC3 recording and reproducing device which can perform record 
reproduction about ATRAC3 audio file is connected to the connector 21 , It is possible 
to enable transmission of an MP3 audio file or ATRAC3 audio file between these 
apparatus and personal computers 1 . 

[0020]According to such composition, via the communication interface 16 or the 
external device interface 20, the personal computer 1 can incorporate an MP3 audio 
file, and can store it in HDD14. Although not illustrated, devices, such as a floppy disk 
drive and a CD-ROM drive, may be formed. By what is established for the MP3 encode 
part to these audio signals inputted from the input terminal of an analog audio signal or 
a digital audio signal, and the input terminal. It may enable it to store in HDD14 by 
making the inputted audio signal into an MP3 audio file. The part as ATRAC3 encoder 
and a decoder may be provided so that record reproduction of ATRAC3 audio file can 
be performed, of course. 

[0021 ]In order that this personal computer 1 may operate as an editing device about 
an MP3 audio file, the functional block composition realized by CPU1 1 , for example by 
software is shown in drawing 3 . Such a natural functional block may be constituted by 
hardware. 

[0022] As shown in drawing 3 , MP3 record / reproduction driver 31 is formed first. This 
MP3 record / reproduction driver 31 performs writing/read-out of the MP3 audio file 
to HDD 14, transmission processing to the audio decode part 18, etc. The 
communication driver 32 performs management through the communication interface 
1 6 of reception of an MP3 audio file, or transmission, and management through the 
external instrument interface 20 of reception of an MP3 audio file, or transmission. 



[0023]MP3 / WAVE conversion function 33 performs processing which changes an 
MP3 audio file into WAVE data, and processing which changes WAVE data into an MP3 
audio file. The WAVE/ATRAC3 conversion function 34 performs processing which 
changes WAVE data into ATRAC3 audio file, and processing which changes ATRAC3 
audio file into WAVE data. The ATRAC3 edit function 35 performs processing of the 
editing processing to ATRAC3 audio file, for example, deletion of music, division, 
connection, playing order movement, etc. 

[0024]By these functional blocks being formed, execution of the editing processing 
explained below is attained in the personal computer 1. 

[0025]3. Editing processing drawing 4 explains the editing processing to an MP3 audio 
file. Step F101 of drawing 4 is processing which acquires the specific MP3 audio file as 
an editing object. For example, when a user asks for execution of editing processing 
about a certain MP3 audio file stored in HDD14, CPU11 reads HDD14 to one MP3 
audio file with MP3 record / reproduction driver 31. Or with the communication driver 
32, the MP3 recording and reproducing device 3 to an MP3 audio file may also be 
incorporated from a specific server via the external instrument interface 20 via the 
communication interface 16. 

[0026]Then, CPU1 1 is Step F102 and changes the acquired MP3 audio file into WAVE 
data by MP3 / WAVE conversion function 33. Then, CPU1 1 is Step F103 and changes 
WAVE data into ATRAC3 audio file by the WAVE/ ATRAC3 conversion function 34. 
And at Step F104, editing processing according to a user s operation is performed by 
the ATRAC 3 edit function 35 to ATRAC3 audio file obtained by the above-mentioned 
conversion process. Later, DEBAIDO (division) edit and combine (connection) edit are 
described as an example of edit performed here. 

[0027]If the editing processing for which the user asked is completed, CPU11 will be 
Step F105 and will perform processing which changes ATRAC 3 audio file after edit 
into WAVE data by the WAVE/ATRAC3 conversion function 34. Furthermore, CPU1 1 
is Step F106 and performs processing which changes WAVE data into an MP3 audio 
file by MP3 / WAVE conversion function 33. And in Step F107, the MP3 audio file 
changed in this way as an MP3 audio file after edit, It performs writing in HDD 14 with 
MP3 record / reproduction driver 31, or transmitting to MP3 recording and 
reproducing device 5 grade with the communication driver 32 etc. 
[0028]By this example, as mentioned above about an MP3 audio file. After once 
changing into ATRAC3 audio file, necessary editing processing is performed, and the 
MP3 audio file is made to be edited by returning ATRAC3 audio file as the edit result 
to an MP3 audio file as a result. As mentioned above, since editing processing is a 
very easy data format, after changing ATRAC3 data file into ATRAC3 data file in this 
way, it is performing editing processing, and edit of an MP3 data file can realize it 
easily. Generally, by the software on a personal computer, since a sufficiently 
high-precision floating point arithmetic can be performed freely, the tone quality 



degradation in the case of conversion by MP3 / WAVE conversion function 33, and 
the WAVE/ATRAC3 conversion function 34 can be suppressed to the minimum. 
[0029]processing of drawing 4 — in the ATRAC3 data-file stage (edit direct front 
stirrup immediately after edit) to kick, The ATRAC3 data file is transmitted to the 
ATRAC3 recording and reproducing device 6 via the external device interface 20, and 
it may be made to make it record on the recording medium in the ATRAC3 recording 
and reproducing device 6. In this case, it becomes possible to use an MP3 audio file 
with the ATRAC3 recording and reproducing device 6. For example, supposing the 
ATRAC3 recording and reproducing device 6 is a recording and reproducing device to 
the memory card etc. which contained the flash memory, It is made to record on a 
memory card by making into ATRAC3 audio file the MP3 audio file stored in HDD14, 
and it becomes possible from the memory card to enjoy music reproduction etc. 
[0030]4. Explain the example of editing processing over ATRAC3 audio file performed 
at Step F1 04 of edit above-mentioned drawing 4 of ATRAC3. 

[0031]ATRAC3 audio file (only henceforth an audio file) as a file of one music unit is 
shown in drawing 5 . This audio file consists of a top attribute header and enciphered 
actual music data following it. An attribute header is made into 16-KB fixed length. 
The attribute header of the head of an audio file consists of track information 
TRKINF(s), such as track name NM2 of 1 or 2 bytes of track name NM code of a 
header and a single byte code, and key information on a track, part information 
PRTINF, and additional information INF of a track. The information on the total number 
of multipart forms, the attribute of a name, and the size of additional information, etc. 
are included in a header. 

[0032]In this audio file, the music data of ATRAC3 continues to an attribute header. 
Music data is divided for 1 6 KB of every block, and the header is added to the head of 
each block. 

[0033]With reference to drawing 6 , the relation between music and ATRAC3 data file 
is explained. One music comprises one ATRAC3 data file (refer to drawing 5 ). The 
audio information into which ATRAC3 data file was compressed by ATRAC3 is 
recorded. 

[0034]Although one music comprises one part fundamentally, when edit is performed, 
one music may comprise two or more parts. A part means the unit of the data 
recorded by within a time [ which continued from a recording start to the stop ]. When 
one music comprises two or more parts, relation of the part in music is managed by 
part information PRTINF in the attribute header of each music. That is, the position of 
the start sound unit (it is written as SU) in the total of the cluster which a part has in 
part information PRTINF, a head, and the cluster of an end, the position of the end SU, 
etc. are described. A cluster here is a unit of 16KByte equivalent to the block of 
drawing 5 . By having a describing method of such a part, when editing music data, it 
becomes possible to usually lose movement of a lot of music data needed. If it limits to 



edit of a block unit, movement of music data is avoidable similarly, but as compared 
with SU unit, the edit unit of a block unit is too large. 

[0035]SU is the minimum unit of a part and is the minimum data unit when 
compressing audio information by ATRAC3. Hundreds of bytes of data which 
compressed into 10 about 1/of audio information for 1024 samples obtained by a 
44.1 -kHz sampling frequency (1024x16 bits x two channels) is SU. 1SU will be 
converted into time and will be an about 23~m second. Usually, one part is constituted 
by SU which attains to thousands. When one cluster comprises 42 SU(s), the sound 
for about 1 second can be expressed with one cluster. The number of the parts which 
constitute one music is influenced by additional information size. Since it is decided by 
the number excluding a header, a track name, additional information data, etc. out of 1 
block, the number of multipart forms serves as conditions for which the state where 
there is no additional information can use the part of the maximum number (645 
pieces). 

[0036] Drawing 6 shows the example at the time of recording the audio information 
recorded on CD etc. on a certain recording medium as ATRAC3 continuous audio file 
of two music. The case where the 2nd (audio file #2) music is constituted from six 
clusters (CL5-CL10) by drawing 6 (c) when the 1 st (audio file #1) music is constituted 
from five clusters (CL0-CL4) by drawing 6 (a) is shown. Since two files are not allowed 
to be intermingled in one cluster between the 2nd music with the 1st music, audio file 
#2 is created from the beginning of the following cluster (CL5). Therefore, as the 
termination (termination of the 1st music) of audio file #1 is a cluster, even if it is 
located, as expanded and shown in drawing 9 (b), data (SU) shall not exist in the 
remaining portion of the cluster. The 2nd music (audio file #2) is the same. And in the 
case of this example, audio file #1 and #2 comprise one part. 

[0037]To ATRAC3 audio file of such a gestalt, DEBAIDO (division of music), a 
combine (connection of music), deletion (deletion of music), a move (playing order 
movement), etc. are made possible as edit. DEBAIDO is dividing one music into two. If 
DEBAIDO is carried out, the total one number of music will increase. DEBAIDO divides 
one file on a file system, and considers it as two files. A combine is combining two 
music with one. If a combine is carried out, the total one number of music will 
decrease. A combine unifies two files on a file system, and carries out them to one file. 
Deletion is eliminating music. One music number after being erased decreases. 
[0038]The result of having carried out the combine of the two music (audio file #1 , #2) 
shown in drawing 6 is shown in drawing 7 . By the combine having been carried out, 
audio file #1 and #2 are set to one audio file #1, and this audio file #1 is formed from 
two parts. Since there is a describing method about a part in this example as 
mentioned above, the starting position of the part 1, the end position of the part 1, the 
starting position of the part 2, and the end position of the part 2 can be specified to 
the result ( drawing 7 ) of having carried out the combine, per SU, respectively. As a 



result, in order to pack the crevice between the knots of the result which carried out 
the combine, it is not necessary to move the music data of the part 2. 
[0039] Drawing 8 shows the result, DEBAIDO [ one music (audio file #1) of drawing 6 
(a) ] in the middle of the cluster 2. Of course, a division point turns into a point of the 
break of the frame shown in drawing 1 . By DEBAIDO, data file #2 which consists of 
cluster CL3 and CL4 data file #1 which consists of a front side of cluster CLO, CL1, 
and cluster CL2, and the backside of cluster CL2 (CL11) occurs. Since two files are 
not allowed to be intermingled in one cluster as mentioned above, in the DEBAIDO 
edit which makes a division point a certain position in cluster CL2 in this way, it is first 
copied to cluster CL1 1 [ another ] in which the data of cluster CL2 has opened. And 
the position equivalent to the division point in cluster CL1 1 is made into the starting 
point, and it is made for cluster CL3 and CL4 to follow the cluster CL1 1 in data file #2. 
Since there is a describing method about a part as mentioned above, it is not 
necessary to move data so that the opening of the head (cluster CL1 1) of audio file #2 
may be filled in the result ( drawing 8 ), DEBAIDO. 

[0040]As mentioned above, about the personal computer 1 which serves as an editing 
device of this invention as an embodiment, although the composition and processing 
operation have been explained, various modifications can be considered as the editing 
device of this invention, and an editing method. For example, the editing device of this 
invention is realizable with the gestalt as an audio recording and reproducing device 
not a gestalt like a personal computer but a deferred type, or portable. Although it was 
made to edit by changing an MP3 audio file into ATRAC3, it may be made to edit by 
changing into ATRAC, ATRAC2, etc., for example. Other data forms which were 
suitable for edit, of course may be sufficient. 
[0041] 

[Effect of the Invention]As explained above, in this invention, the audio information of 
the 1st data form by variable-length compression technology is changed into the 
audio information of the 2nd data form by fixed-length compression technology, Since 
editing processing is performed about the audio information of the 2nd data form and 
it was made to return to the audio information of the 1 st data form after that, it is 
effective in edit becoming possible without the processing burden like **, for example 
about the audio information of the 1st data form, such as MP3. 



DESCRIPTION OF DRAWINGS 



[Brief Description of the Drawings] 

[Drawing 1] It is an explanatory view of the ATRAC3 data structure which have and is 
by an embodiment of the invention. 



[Drawing 2] It is a block diagram of the personal computer used as the editing device of 
an embodiment. 

[Drawing 3] It is an explanatory view of the functional block in CPU of the personal 
computer of an embodiment. 

[Drawing 4] It is a flow chart of the editing processing of an embodiment. 
[Drawing 5] It is an explanatory view of the structure of ATRAC3 audio file. 
[Drawing 6] It is an explanatory view of the part composition of ATRAC3 audio file. 
[Drawing 7] It is an explanatory view of combine edit of ATRAC3 audio file. 
[Drawing 8] It is an explanatory view of DEBAIDO edit of ATRAC3 audio file. 
[Drawing 9] It is an explanatory view of the data structure of MP3. 
[Description of Notations] 

1 A personal computer and 2 An input device and 3 A display and 4 Loudspeaker, 5 An 
MP3 recording and reproducing device, 6 ATRAC3 recording and reproducing device, 
and 7 Transmission line, 11 CPUs, 12 RAM, 13 ROM, 14 HDD, 15 input interfaces, 16 A 
communication interface and 17 A display driving part and 18 Audio decode part, 19 A 
D/A converter and 20 An external device interface and 21 A connector, 31 MP3 
record / reproduction driver, 32 communication drivers, 33 MP3 / WAVE conversion 
function, 34 WAVE/ATRAC3 conversion function, 35 ATRAC3 edit function 



